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1 Fast detection of conrimunication patterns in distributed executions 1 
Thonnas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 
Studies on Collaborative research 

Publisher: IBM Press 

Full text available: g pdf(4.21 MB) Additional Information: full citation , abstract , references , index ternns 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the execution 
of the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not 
provide the user with the desired overview of the application. In our experience, such tools 
display repeated occurrences of non-trivial commun ... 



Efficient lo g ic variables for distributed computing 

Self Haridi, Peter Van Roy, Per Brand, Michael Mehl, Ralf Scheidhauer, Gert Smolka 
May 1999 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 21 Issue 3 
Publisher: ACM Press 

r- .. . ^ u. 0k ^r/iz-7o -DC i/Dx Additional Information: full citation , abstract , references , citings , index 

Full text available: to pdf(572.35 KB) ^ : 

1^=*^ terms 

We define a practical algorithm for distrubuted rational tree unification and prove its 
correctness In both the off-line and on-line cases. We derive the distributed algorithm from 
a centralized one, showing clearly the trade-offs between local and distributed execution. 
The algorithm is used to realize logic variables in the Mozart Programming System, which 
implements the Oz language (see http://www/mozart-oz.org). Oz appears to the 
programmer as a concurrent object-oriented language with ... 



Keywords: Mozart, Oz, distributed algorithms 



Real-time shading 

Marc Olano, Kurt Akeley, John C. Hart, Wolfgang Heidrich, Michael McCool, Jason L. Mitchell, 
Randi Rost 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes 
SIGGRAPH '04 

Publisher: ACM Press 

Full text available: ^ pdf(7.39 MB) Additional Information: full citation , abstract 

Real-time procedural shading was once seen as a distant dream. When the first version of 
this course was offered four years ago, real -time shading was possible, but only with one- 
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of-a-kind hardware or by combining the effects of tens to hundreds of rendering passes. 
Today, almost every new computer comes with graphics hardware capable of interactively 
executing shaders of thousands to tens of thousands of instructions. This course has been 
redesigned to address today's real-time shading capabili ... 

4 GPGPU: general purpose computation on gra phics hardware |jj| 
David Luebke, Mark Harris, Jens Kruger, Tim Purcell, Naga Govindaraju, Ian Buck, Cliff 
Woolley, Aaron Lefohn 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes 
SIGGRAPH '04 

Publisher: ACM Press 

Full text available: ^ pdf(63.03 MB) Additional Information: full citation , abstract 

The graphics processor (GPU) on today's commodity video cards has evolved into an 
extremely powerful and flexible processor. The latest graphics architectures provide 
tremendous memory bandwidth and computational horsepower, with fully programmable 
vertex and pixel processing units that support vector operations up to full IEEE floating 
point precision. High level languages have emerged for graphics hardware, making this 
computational power accessible. Architecturally, GPUs are highly parallel s ... 

Verification techniques for cache coherence protocols B 
Fong Pong, Michel Dubois 

March 1997 ACM Computing Surveys (CSUR), Volume 29 issue i 
Publisher: ACM Press 

i-iii ^ -I ui Alt ^f/^ocK/(D\ Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdfn.25MB) ^ ^ 

terms 

In this article we present a comprehensive survey of various approaches for the 
verification of cache coherence protocols based on state enumeration, (symbolic model 
checking, and symbolic state models. Since these techniques search the state space of the 
protocol exhaustively, the amount of memory required to manipulate that state 
information and the verification time grow very fast with the number of processors and the 
complexity of the protocol mechanism ... 

Keywords: cache coherence, finite state machine, protocol verification, shared-memory 
multiprocessors, state representation and expansion 



6 Simplify: a theorenn prover for program checking 

David Detlefs, Greg Nelson, James B. Saxe 
^ May 2005 Journal of the ACM (JACM), Volume 52 issue 3 

Publisher: ACM Press 

Full text available: ^ pdf(1 .93 M B) Additional Information: full citation , abstract , references , index terms 

This article provides a detailed description of the automatic theorem prover Simplify, 
which is the proof engine of the Extended Static Checkers ESC/Java and ESC/Modula-3. 
Simplify uses the Nelson--Oppen method to combine decision procedures for several 
important theories, and also employs a matcher to reason about quantifiers. Instead of 
conventional matching in a term DAG, Simplify matches up to equivalence in an E-graph, 
which detects many relevant pattern instances that would be missed by th ... 

Keywords: Theorem proving, decision procedures, program checking 




7 Run-time adaptation in river B 
Remzi H. Arpaci-Dusseau 

February 2003 ACM Transactions on Computer Systems (TOCS), Volume 21 issue 1 
Publisher: ACM Press 

Full text available: Q pdf(849.04 KB) Additional Information: full citation , abstract , references , index terms 
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We present the design, implementation, and evaluation of run-time adaptation within the 
River dataflow programming environment. The goal of the River system is to provide 
adaptive mechanisms that allow database query-processing applications to cope with 
performance variations that are common in cluster platforms. We describe the system and 
its basic mechanisms, and carefully evaluate those mechanisms and their effectiveness. In 
our analysis, we answer four previously unanswered and important que ... 

Keywords: Performance availability, clusters, parallel I/O, performance faults, robust 
performance, run-time adaptation 



S ystem-level power optimization: techniques and tools 
Luca Benini, Giovanni de Micheli 

April 2Q00 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 5 Issue 2 
Publisher: ACM Press 

I- II * * I ui 0t ^f/ooc oo i^D\ Additional Information: full citation , abstract , references , citing s, index 

Full text available: TO pdf(385.22 KB) ^ 

^^^i-^-^ terms 

This tutorial surveys design methods for energy-efficient system-level design. We consider 
electronic sytems consisting of a hardware platform and software layers. We consider the 
three major constituents of hardware that consume energy, namely computation, 
communication, and storage units, and we review methods of reducing their energy 
consumption. We also study models for analyzing the energy cost of software, and 
methods for energy-efficient software design and compilation. This survery ... 

Toward a logical/physical theory of spreadsheet modeling 
Tomas Isakowitz, Shimon Schocken, Henry C. Lucas 

January 1995 ACM Transactions on Information Systems (TOIS), Votume 13 issue i 
Publisher: ACM Press 

•. u. 0 ^f/o -ro n/iov Additional Information: full citation , abstract , references , citin gs, index 

Full text available: 1T1 pdf(2.76 MB) ^ : ^ 

^ terms , review 

In spite of the increasing sophistication and power of commercial spreadsheet packages, 
we still lack a formal theory or a methodology to support the construction and 
maintenance of spreadsheet models. Using a dual logical/physical perspective, we identify 
four principal components that characterize any spread sheet model: schema, data, 
editorial, and binding. We present a factoring algorithm for identifying and extracting 
these components ... 

Keywords: model management 



10 HFS: a performance-oriented flexible file system based on building-block 

^ compositions 

^ Orran Krieger, Michael Stumm 

August 1997 ACM Transactions on Computer Systems (TOCS), Volume 15 issue 3 

Publisher: ACM Press 

-I ui 0t ^*/ooo Q-7 i^D\ Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(383.87 KB) 

terms , review 

The Hurricane File System (HFS) is designed for (potentially large-scale) shared-memory 
multiprocessors. Its architecture is based on the principle that, in order to maximize 
performance for applications with diverse requirements, a file system must support a wide 
variety of file structures, file system policies, and I/O interfaces. Files in HFS are 
implemented using simple building blocks composed in potentially complex ways. This 
approach yields great flexibility, allowing an application ... 

Keywords: customization, data partitioning, data replication, flexibility, parallel 
computing, parallel file system 
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1'' Symbolic bounds analysis of pointers, array indices, and accessed memory regions Q 
Radu Rugina, Martin C. Rinard 

March 2005 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 27 Issue 2 
Publisher: ACM Press 

Full text available: ^ pdf(490.56 KB) Additional Information: full citation , abstract , references , index terms 

This article presents a novel framework for the symbolic bounds analysis of pointers, array 
Indices, and accessed memory regions. Our framework formulates each analysis problem 
as a system of inequality constraints between symbolic bound polynomials. It then reduces 
the constraint system to a linear program. The solution to the linear program provides 
symbolic lower and upper bounds for the values of pointer and array index variables and 
for the regions of memory that each statement and procedur ... 

Keywords: Symbolic analysis, parallelization, static race detection 




^2 Cloning-based context-sensitive pointer alias analysis using binary decision dia g rams 
John Whaley, Monica S. Lam 

June 2004 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2004 conference 
on Programming language design and implementation PLDI '04, Volume 39 
Issue 6 
Publisher: ACM Press 

Full text available* Ddf(277 87 KB) A^^'^'O"^' Information: full citation , abstract , references , citings , index 
' T^^l"^ * terms 

This paper presents the first scalable context-sensitive, inclusion-based pointer alias 
analysis for Java programs. Our approach to context sensitivity is to create a clone of a 
method for every context of interest, and run a context-insensitive algorithm over the 
expanded call graph to get context-sensitive results. For precision, we generate a clone for 
every acyclic path through a program's call graph, treating methods in a strongly 
connected component as a single node. Normally ... 

Keywords: Datalog, Java, binary decision diagrams, cloning, context-sensitive, inclusion- 
based, logic programming, pointer analysis, program analysis, scalable 



Coyote: a system for constructing fine-grain configurable communication services 
Nina T. Bhatti, Matti A, Hiltunen, Richard D. Schlichting, Wanda Chiu 
November 1998 ACM Transactions on Computer Systems (TOCS), volume 16 issue 4 

Publisher: ACM Press 

Full text available* ^ df(290 21 KB) ^^^^'^'onal Information: full citation , abstract , references , citings, index 

terms 

Communication-oriented abstractions such as atomic multicast, group RPC, and protocols 
for location-independent mobile computing can simplify the development of complex 
applications built on distributed systems. This article describes Coyote, a system that 
supports the construction of highly modular and configurable versions of such 
abstractions. Coyote extends the notion of protocol objects and hierarchical composition 
found in existing systems with support for finer-grain microprotocol ob ... 

Keywords: x-kernal, configurable sevices, customization, event handlers, event-driven 
execution, membership, microprotocols, mobile computing, modularity, multicast, 
protocols, remote procedure call 



''4 Tutorial: Compiling concurrent languages for sequential processors 
Stephen A. Edwards 

April 2003 ACM Transactions on Design Automation of Electronic Systems (TODAES), 

Volume 8 Issue 2 
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Publisher: ACM Press 

Fu!l text available* pdf(771.65 KB) Additional Information: full citation , abstract , references , citings , index 
* terms , review 

Embedded systems often include a traditional processor capable of executing sequential 
code, but both control and data-dominated tasks are often more naturally expressed using 
one of the many domain-specific concurrent specification languages. This article surveys a 
variety of techniques for translating these concurrent specifications into sequential code. 
The techniques address compiling a wide variety of languages, ranging from dataflow to 
Petri nets. Each uses a different method, to some degr ... 

Keywords: Compilation, Esterel, Lustre, Petri nets, Verilog, code generation, 
communication, concurrency, dataflow, discrete-event, partial evaluation, sequential 
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The Vesta parallel file system is designed to provide parallel file access to application 
programs running on multicomputers with parallel I/O subsystems. Vesta uses a new 
abstraction of files: a file is not a sequence of bytes, but rather it can be partitioned into 
multiple disjoint sequences that are accessed in parallel. The partitioning— which can also 
be changed dynamically— reduces the need for synchronization and coordination during 
the access. Some. control over the layout ... 

Keywords: data partitioning, parallel computing, parallel file system 
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Cats (Concurrency Analysis Tool Suite) is designed to satisfy several criteria: it must 
analyze implementation-level Ada source code and check user-specified conditions 
associated with program source code; it must be modularized in a fashion that supports 
flexible composition with other tool components, including integration with a variety of 
testing and analysis techniques; and its performance and capacity must be sufficient for 
analysis of real application programs. Meeting these objectiv ... 

Keywords: Ada, concurrency, software development environments, static analysis, tool 
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Single-language runtime systems, in the form of Java virtual machines, are widely 
deployed platforms for executing untrusted mobile code. These runtimes provide some of 
the features that operating systems provide: interapplication memory protection and basic 
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system services. They do not, however, provide the ability to isolate applications from 
each other. Neither do they provide the ability to limit the resource consumption of 
applications. Consequently, the performance of current systems degra ... 

Keywords: Robustness, garbage collection, isolation, language runtimes, resource 
management, termination, virtual machines 
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SLG resolution uses tabling to evaluate nonfloundering normal logic pr ograms according 
to the well-founded sennantics. The SLG-WAM, which fornns the engine of the XSB system, 
can compute in-memory recursive queries an order of magnltute faster than current 
deductive databases. At the same time, the SLG-WAM tightly intergrates Prolog code with 
tabled SLG code, and executes Prolog code with minimal overhead compared to the WAM. 
As a result, the SLG-WAM brings to logic progr ... 

Keywords: SLG, WAM, memoing, prolog, stratification theories, tabling 
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We argue that intensional object identity in object-oriented programnning languages and 
databases is best defined operationally by side-effect sennantics. A corollary is that 
"functional" objects have extenslonal semantics. This model of object Identity, which is 
analogous to the normal forms of relational algebra, provides cleaner semantics for the 
value-transmission operations and built-in primitive equality predicate of a programming 
language, and eliminates the confusion surrounding "ca ... 
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Pointer-chasing applications tend to traverse composite data structures consisting of 
multiple independent pointer chains. While the traversal of any single pointer chain leads 
to the serialization of memory operations, the traversal of independent pointer chains 
provides a source of memory parallelism. This article investigates exploiting such 
intercfiain memory paraileiism for the purpose of memory latency tolerance, using a 
technique called muiti—cliain prefetciiing. Previous work 
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